Optimizing F-measure: A Tale of Two Approaches

نویسندگان

  • Nan Ye
  • Kian Ming Adam Chai
  • Wee Sun Lee
  • Hai Leong Chieu
چکیده

F-measures are popular performance metrics, particularly for tasks with imbalanced data sets. Algorithms for learning to maximize F-measures follow two approaches: the empirical utility maximization (EUM) approach learns a classifier having optimal performance on training data, while the decision-theoretic approach learns a probabilistic model and then predicts labels with maximum expected F-measure. In this paper, we investigate the theoretical justifications and connections for these two approaches, and we study the conditions under which one approach is preferable to the other using synthetic and real datasets. Given accurate models, our results suggest that the two approaches are asymptotically equivalent given large training and test sets. Nevertheless, empirically, the EUM approach appears to be more robust against model misspecification, and given a good model, the decision-theoretic approach appears to be better for handling rare classes and a common domain adaptation scenario.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Optimizing F-Measures: A Tale of Two Approaches

F-measures are popular performance metrics, particularly for tasks with imbalanced data sets. Algorithms for learning to maximize F-measures follow two approaches: the empirical utility maximization (EUM) approach learns a classifier having optimal performance on training data, while the decision-theoretic approach learns a probabilistic model and then predicts labels with maximum expected F-me...

متن کامل

Optimizing Non-decomposable Performance Measures: A Tale of Two Classes

Modern classification problems frequently present mild to severe label imbalance as well as specific requirements on classification characteristics, and require optimizing performance measures that are non-decomposable over the dataset, such as F-measure. Such measures have spurred much interest and pose specific challenges to learning algorithms since their non-additive nature precludes a dire...

متن کامل

Approaches to Improving Self-Efficacy among 8th Graders: Solution-Focused and Strategic Therapies

Approaches to Improving Self-Efficacy among 8th Graders: Solution-Focused and Strategic Therapies A. Bahraami F. Bahaari, Ph.D.  To compare the effectiveness of two approaches to improving self-efficacy, a specially selected sample of 45 eighth graders was randomly placed into three groups of control and experimental. The experimental groups were taught for ten sessions o...

متن کامل

Modeling Dynamic Production Systems with Network Structure

This paper deals with the problem of optimizing two-stage structure decision making units (DMUs) where the activity and the performance of two-stage DMU in one period effect on its efficiency in the next period. To evaluate such systems the effect of activities in one period on ones in the next term must be considered. To do so, we propose a dynamic DEA approach to measure the performance of su...

متن کامل

An Architectural Tale of the Two Cities

A comparative study of the corresponding styles of Western and Iranian modern architecture has hardly ever been carried out in detail. This paper aims to sketch out an outline for such an investigation and to present a summary of empirical evidence accompanied by field observations to elaborate the ongoing trend of relationship between architectural styles in Iran and that of the West. This is ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • CoRR

دوره abs/1206.4625  شماره 

صفحات  -

تاریخ انتشار 2012